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Abstract 

One of the classic results in scheduling theory is the 2-approximation algorithm by Lenstra, 
Shmoys, and Tardos for the problem of scheduling jobs to minimize makespan on unrelated 
machines, i.e., job j requires time pij if processed on machine i. More than two decades after 
its introduction it is still the algorithm of choice even in the restricted model where processing 
times are of the form £ {pj,oo}. This problem, also known as the restricted assignment 
problem, is NP-hard to approximate within a factor less than 1.5 which is also the best known 
lower bound for the general version. 

Our main result is a polynomial time algorithm that estimates the optimal makespan of 
the restricted assignment problem within a factor 33/17 + e « 1.9412 + e, where e > is an 
arbitrarily small constant. The result is obtained by upper bounding the integrality gap of a 
certain strong linear program, known as configuration LP, that was previously successfully used 
for the related Santa Claus problem. Similar to the strongest analysis for that problem our 
proof is based on a local search algorithm that will eventually find a schedule of the mentioned 
approximation guarantee, but is not known to converge in polynomial time. 

1 Introduction 

Scheduling on unrelated machines is the model where we are given a set J of jobs to be processed 
without interruption on a set M of unrelated machines, where the time a machine i G A4 needs to 
process a job j € J is specified by a machine and job dependent processing time pij > 0. When 
considering a scheduling problem the most common and perhaps most natural objective function 
is makespan minimization. This is the problem of finding a schedule, also called an assignment, 
a : J i— > M. so as to minimize the time maxj g _A4 Ylj<=a- 1 (i)Pij required to process all the jobs. 

A classic result in scheduling theory is Lenstra, Shmoys, and Tardos' 2-approximation algorithm 
for this basic problem |13j . Their approach is based on several nice structural properties of the 
extreme point solutions of a natural linear program and has become a text book example of such 
techniques (see, e. g., [IB])- Complementing their positive result they also proved that the problem is 
NP-hard to approximate within a factor less than 1.5 even in the restricted case when p^j E {pj, oo} 
(i. e., when job j has processing time pj or oo for each machine). This problem is also known as the 
restricted assignment problem and, although it looks easier than the general version, the algorithm 
of choice has been the same 2-approximation algorithm as for the general version. 

Despite being a prominent open problem in scheduling theory, there has been very little progress 
on either the upper or lower bound since the publication of |13j over two decades ago. One of the 
biggest hurdles for improving the approximation guarantee has been to obtain a good lower bound 
on the optimal makespan. Indeed, the considered linear program has been useful for generalizations 
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such as introducing job and machine dependent costs |16| [T7] but is known to have an integrality 
gap of 2 — 1/| M | even in the restricted case. We note that Shchepin and Vakhania |15j presented 
a rounding achieving this gap slightly improving upon the approximation ratio of 2. 

In a relatively recent paper, Ebenlendr et al. [6] overcame this issue in the special case of the 
restricted assignment problem where a job can be assigned to at most two machines. Their strategy 
was to add more constraints to the studied linear program, which allowed them to prove a 1.75- 
approximation algorithm for this special case that they named Graph Balancing. The name arises 
naturally when interpreting the restricted assignment problem as a hypergraph with a vertex for 
each machine and a hyperedge T(j) = {i £ Ai : pij = pj} for each job j £ J that is incident to the 
machines it can be assigned to. As pointed out by the authors of [6] it seems difficult to extend 
their techniques to hold for more general cases. In particular, it can be seen that the considered 
linear program has an integrality gap of 2 when we allow jobs that can be assigned to 3 machines. 

In this paper we overcome this obstacle by considering a certain strong linear program, often 
referred to as configuration LP. In particular, we obtain the first asymptotic improvement on the 
approximation factor of 2. 

Theorem 1.1 There is a polynomial time algorithm that estimates the optimal makespan of the 
restricted assignment problem within a factor of 33/17 + e ~ 1.9412 + e, where e > is an arbitrarily 
small constant. 

We note that our proof gives a local search algorithm to also find a schedule with performance 
guarantee || but it is not known to converge in polynomial time. 

Our techniques are based on the recent development on the related Santa Claus problem. In 
the Santa Claus problem we are given the same input as in the considered scheduling problem 
but instead of wanting to minimize the maximum we wish to maximize the minimum, i.e., to 
find an assignment a so as to maximize minj e _A/i Sj go — Vij- The playful name now follows from 
associating the machines with kids and jobs with presents. Santa Claus' problem then becomes to 
distribute the presents so as to make the least happy kid as happy as possible. 

The problem was first considered under this name by Bansal and Sviridenko [3]. They for- 
mulated and used the configuration LP to obtain an 0(loglog log \M. \j log log ^-approximation 
algorithm for the restricted Santa Claus problem, where pij £ {pj,0}. They also proved several 
structural properties that were later used by Feige [7] to prove that the integrality gap of the con- 
figuration LP is in fact constant in the restricted case. The proof is based on repeated use of Lovasz 
local lemma and was only recently turned into a polynomial time algorithm [9]. 

The approximation guarantee obtained by combining [7] and [9] is a large constant and the 
techniques do not seem applicable to the considered problem. This is because the methods rely on 
structural properties that are obtained by rounding the input and such a rounding applied to the 
scheduling problem would rapidly eliminate any advantage obtained over the current approximation 
ratio of 2. Instead, our techniques are mainly inspired by a paper of Asadpour et al. pQ who gave a 
tighter analysis of the configuration LP for the restricted Santa Claus problem. More specifically, 
they proved that the integrality gap is lower bounded by 1/4 by designing a local search algorithm 
that eventually finds a solution with the mentioned approximation guarantee, but is not known to 
converge in polynomial time. 

Similar to their approach, we formulate the configuration LP and show that its integrality gap 
is upper bounded by 33/17 by designing a local search algorithm. As the configuration LP can 
be solved in polynomial time up to any desired accuracy [3], this implies Theorem |1.1| Although 
we cannot prove that the local search converges in polynomial time, our results imply that the 
configuration LP gives a polynomial time computable lower bound on the optimal makespan that 
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is strictly better than two. We emphasize that all the results related to hardness of approximation 
remain valid even for estimating the optimal makespan. 

Before proceeding, let us mention that unlike the restricted assignment problem, the special 
case of uniform machines and that of a fixed number of machines are both significantly easier to 
approximate and are known to admit polynomial time approximation schemes [10\ [TT| [T2] . Also 
scheduling jobs on unrelated machines to minimize weighted completion time instead of makespan 
has a better approximation algorithm with performance guarantee 1.5 [14]. The Santa Claus 
problem has also been studied under the name Max-Min Fair Allocation and there have been 
several recent results for the general version of the problem (see e.g. [21111 IS])- 

Compared to [I], our analysis is more complex and relies on the special structure of the dual 
of the linear program. To illustrate the main techniques, we have therefore chosen to first present 
the analysis for the case of only two job sizes (Section [3]) followed by the general case in Section |4l 



2 Preliminaries 

As we consider the restricted case with pij £ {pj,oo}, without ambiguity, we refer to pj as the size 
of job j. For a subset J' C J we let p{J') = Yljej'Po ano - often write p(j) for p({j}), which of 
course equals pj. 

We now give the definition of the configuration LP for the restricted assignment problem. Its 
intuition is that a solution to the scheduling problem with makespan T assigns a set of jobs, referred 
to as a configuration, to each machine of total processing time at most T. Formally, we say that a 
subset C C J of jobs is a configuration for a machine i £ A4 if it can be assigned without violating 
a given target makespan T, i.e., C C {j : i £ r(j)} and p(C) < T. Let C(i,T) be the set of 
configurations for machine j £ M with respect to the target makespan T. The configuration LP 
has a variable xic for each configuration C for machine i and two sets of constraints: 

[C-LP] Yj x i,c < 1 ieM 

CeC(i,T) 
C^j i 

x > 

The first set of constraints ensures that each machine is assigned at most one configuration and 
the second set of constraints says that each job should be assigned (at least) once. 

Note that if [C-LP] is feasible with respect to some target makespan To then it is also feasible 
with respect to all T > T$. Let OPT^p denote the minimum over all such values of T. Since 
an optimal schedule of makespan OPT defines a feasible solution to [C-LP] with T = OPT, we 
have OPTlp < OPT. To simplify notation we will assume throughout the paper that OPTlp = 1 
and denote C(i,l) by C(i). This is without loss of generality since it can be obtained by scaling 
processing times. 

Although [C-LP] might have exponentially many variables, it can be solved (and OPTlp can be 
found by binary search) in polynomial time up to any desired accuracy e > [3J. The strategy of [3] 
is to design a polynomial time separation oracle for the dual and then solve it using the ellipsoid 
method. To obtain the dual, we associate a dual variable yi with i G Ai for each constraint from 
the first set of constraints and a dual variable Zj with j G J for each constraint from the second 
set of constraints. Assuming that the objective of [C-LP] is to maximize an objective function with 
zero coefficients then gives the dual: 
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Dual of [C-LP] 



min ^2 Vi ~ ^2 



i£M jej 



i€M,C£C{i) 



y,z>0 



Let us remark that, given a candidate solution (y* , z*), the separation oracle has to find a violated 
constraint if any in polynomial time and this is just m knapsack problems: for each i £ Ai solve 
the knapsack problem with capacity 1 and an item with weight pj and profit zj for each j E 3 
with % 6 r(j). By rounding job sizes as explained in [3], we can thus solve [C-LP] in polynomial 
time up to any desired accuracy. 

3 Overview of Techniques: Jobs of Two Sizes 

We give an overview of the main techniques used by considering the simpler case when we have jobs 
of two sizes: small jobs of size e and big jobs of size 1. Already for this case all previously considered 
linear programs have an integrality gap of 2. In contrast we show the following for [C-LP]. 

Theorem 3.1 // an instance of the scheduling problem only has jobs of sizes e > and 1 then 
[C-LP] has integrality gap at most 5/3 + e. 

Throughout this section we let R = 2/3 + e. The proof strategy is to design a local search 
algorithm that returns a solution with makespan at most 1 + R, assuming the [C-LP] is feasible. 
The algorithm starts with a partial schedule a with no jobs assigned to any machine. It will then 
repeatedly call a procedure, summarized in Algorithm [TJ that extends the schedule by assigning 
a new job until all jobs are assigned. When assigning a new job we need to ensure that a will 
still have a makespan of at most 1 + R. This might require us to also update the schedule a by 
moving already assigned jobs. For an example, consider Figure [T] where we have a partial schedule 
and wish to assign a new big job j new . In the first step we try to assign j new to Mi but discover 
that Mi has too high load, i.e., the set of jobs assigned to Mi have total processing time such that 
assigning j new to Mi would violate the target makespan 1 + R. Therefore, in the second step we 
try to move jobs from Mi to M2 but M2 has also too high load. Instead, we try to move j new to 
M3. As M3 already has a big job assigned we need to first reassign it. We try to reassign it to M4 
in the fourth step. In the fifth step we manage to move small jobs from M4 to M3, which makes it 
possible to also move the big job assigned to M3 to M4 and finally assign j new to M3. 

Valid schedule and move. As alluded to above, the algorithm always maintains a valid partial 
schedule by moving already assigned jobs. Let us formally define these concepts. 

Definition 3.2 A partial schedule is an assignment a : J 1— > M. U {TBD} with the meaning that 
a job j with o~(j) = TBD is not assigned. A partial schedule is valid if each machine i G Ai is 
assigned at most one big job and p(a~ 1 (i)) < 1 + R. 

That i is assigned at most one big job is implied here by p(a~ 1 (i)) < 1 + R but will be used for 
the general case in Section [4} Note also that with this notation a normal schedule is just a partial 



schedule a with a' 1 (TBD) = 0. 
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Figure 1: Possible steps when moving jobs to assign a new job j new - Big and small jobs depicted 
in dark and light grey, respectively. 

Definition 3.3 A move is a tuple (J,i) of a job j € J and a machine i G T a (j), where T a (j) = 
T(j) \ {o~(j)} denotes the machines to which j can be assigned apart from cr(j'). 

The main steps of the algorithm are the following. At the start it will try to choose a valid 
assignment of jnew to a machine, i.e., that can be made without violating the target makespan. If 
no such assignment exists then the algorithm adds the set of jobs that blocked the assignment of 
jnew to the set of jobs we wish to move. It then repeatedly chooses a move of a job j from the set 
of jobs that is blocking the assignment of jnew If the move of j is valid then this will intuitively 
make more space for j new . Otherwise the set of jobs blocking the move of j is added to the list of 
jobs we wish to move and the procedure will in the next iteration continue to move jobs recursively. 

To ensure that we will be able to eventually assign a new job j new it is important which moves 
we choose. The algorithm will choose between certain moves that we call potential moves, defined 
so as to guarantee (i) that the procedure terminates and that (ii) if no potential move exists then 
we shall be able to prove that the dual of [C-LP] is unbounded, contradicting the feasibility of the 
primal. For this reason, we need to remember to which machines we have already tried to move 
jobs and which jobs we wish to move. We next describe how the algorithm keeps track of its history 
and how this affects which move we choose. We then describe the types of potential moves that 
the algorithm will choose between. 

Tree of blockers. To remember its history, Algorithm [j] has a dynamic tree T of so-called 
blockers that "block" moves we wish to do. Blockers of T have both tree and linear structure. The 
linear structure is simply the order in time the blockers were added to T ■ To distinguish between 
the two we will use child and parent to refer to the tree structure; and after and before to refer to 
the linear structure. We also use the convention that the blockers Bq, B±, . . . , Bt of T are indexed 
according to the linear order. 

Definition 3.4 A blocker B is a tuple that contains a subset J{B) C J of jobs and a machine 
A4(B) that takes value _L if no machine is assigned to the blocker. 

To simplify notation, we refer to the machines and jobs in T by Ai(T) and J(T), respectively. 
We will conceptually distinguish between small and big blockers and use Ms(T) an d Mb{T) to 
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refer to the subsets of A4(T) containing the machines in small and big blockers, respectively. To be 
precise, this convention will add a bit to the description of a blocker so as to keep track of whether 
a blocker is small or big. 

The algorithm starts by initializing the tree T with a special small blocker B as root. Blocker B 
is special in the sense that it is the only blocker with no machine assigned, i.e., M(B) = _L. Its job 
set J{B) includes the job j new we wish to assign. The next step of the procedure is to repeatedly 
try to move jobs, until we can eventually assign j new . During its execution, the procedure also 
updates T based on which move that is chosen so that 

1. Ms(T) contains those machines to which the algorithm will not try to move any jobs; 

2. Mb(T) contains those machines to which the algorithm will not try to move any big jobs; 

3. J(T) contains those jobs that the algorithm wishes to move. 

Potential moves. For a move (j, i) to be useful it should be of some job j G J(T) as this set 
contains those jobs we wish to move to make space for the unassigned job j new . In addition, the 
move (j, i) should have a potential of succeeding and be to a machine i where j is allowed to be 
moved according to T ■ We refer to such moves as "potential moves and a subset of them as valid 
moves. The difference is that for a potential move to succeed it might be necessary to recursively 
move other jobs whereas a valid move can be done immediately. With this intuition, let us now 
define these concepts formally. 

Definition 3.5 A move (j, i) of a job j G J{T) is a potential 
small move: if j is small and i G" A4s(T); 

big-to-small move: if j is big, i G" M(T),p(Si) < R, and no big job is assigned to i; 
big-to-big move: if j is big, i G" M(T),p(Si) < R, and a big job is assigned to i; 

where Si = {j G : j is small with T a (j) C Ms(T)}. A potential move is valid if the 

update a(j) <— i results in a valid schedule. 

Note that <Sj refers to those small jobs assigned to % with no potential moves with respect to the 
current tree. The condition p(Si) < R for big moves enforces that we do not try to move big jobs to 
machines where the load cannot decrease to at most R without removing a blocker already present 
in T . The algorithm's behavior depends on the type of the chosen potential move, say (j, i) of a 
job j G J{B) for some blocker B: 

• If (J, i) is a valid move then the schedule is updated by a(j) <— i. Moreover, T is updated by 
removing B and all blockers added after B. This will allow us to prove that the procedure 
terminates with the intuition being that B blocked some move (j',i r ) that is more likely to 
succeed now after j was reassigned. 

• If (j, i) is a potential small or big-to-small move that is not valid then the algorithm adds a 
small blocker B$ as a child to B that consists of the machine i and contains all jobs assigned 
to i that are not already in T ■ Note that after this, since B$ is a small blocker no other jobs 
will be tried to be moved to i. The intuition of this being that assigning more jobs to % would 
make it less likely to be able to assign j to i in the future. 
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• If (j, i) is a potential big-to-big move then the algorithm adds a big blocker Bb as child to B 
that consists of the machine i and the big job that is assigned to i. Since Bb is a big blocker 
this prevents us from trying to assign more big jobs to i but at the same time allow us to try 
to assign small jobs. The intuition being that this will not prevent us from assigning j to i if 
the big job currently assigned to i is reassigned. 

We remark that the rules on how to update T are so that a job can be in at most one blocker 
whereas a machine can be in at most two blockers (this happens if it is first added in a big blocker 
and then in a small blocker). 

Returning to the example in Figure [TJ we can see that after Step 4, T consists of the special 
root blocker with two children, which in turn have a child each. Machines Mi, M2 and M4 belong 
to small blockers whereas M3 belongs to a big blocker. Moreover, the moves chosen in the first, 
second, and the third step are big-to-small, small, and big-to-big, respectively, and from Step 5 to 
6 a sequence of valid moves is chosen. 



Values of moves. In a specific iteration there might be several potential moves available. For 
the analysis it is important that they are chosen in a specific order. Therefore, we assign a vector 
in M 2 to each move and Algorithm [l] will then choose the move with minimum lexicographic value. 

Definition 3.6 // we let Li = cr _1 (z) then a potential move (j,i) has value 



ValCM) 



(0, 0) if valid, 

(l,p(Li)) if small move, 

(2,p(Li)) if big-to-small, 

(3, 0) if big-to-big, 



Note that as the algorithm chooses moves of minimum lexicographic value, it always chooses a 
valid move if available and a potential small move before a potential move of a big job. 



The algorithm. Algorithm [T] summarizes the algorithm discussed above in a concise definition. 
Given a valid partial schedule a and an unscheduled job j new , we prove that the algorithm preserves 
a valid schedule by moving jobs until it can assign j new . Repeating the procedure by choosing a 



unassigned job in each iteration until all jobs are assigned then yields Theorem 3.1 



3.1 Analysis 

Since the algorithm only updates a if a valid move was chosen we have that the schedule stays 
valid throughout the execution. It remains to verify that the algorithm terminates and that there 
always is a potential move to choose. 

Before proceeding with the proofs, we need to introduce some notation. When arguing about 
T we will let 

• % be the subtree of T induced by the blockers Bq, B\, . . . , Bt] 

• S(7t) = {j € J : j is small with T a (j) C A4s(Tt)} and often refer to S(T) by simply S; and 

• Si{T t ) = o-^i) n S(T t ) (often refer to Sj(T) by Si). 
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Algorithm 1 SimpleExtendSchedule(<7, j T 



Initialize T with the root J(B) = {j new } and Ai(B) = _L 
while a (j n ew) is TBD do 

Choose a potential move (j, i) with j 6 <J(T) of minimum lexicographic value 

Let B be the blocker in T such that j G J(B) 

if (j, «) is valid then 

Update the schedule by a(j) <— i 

Update T by removing B and all blockers added after B 
else if (j, i) is either a potential small move or a potential big-to-small move then 

Add a small blocker B s as child to B with M[B S ) = i and J{B S ) = a' 1 ® \ J(T) 
else {(j, i) is a big-to-big move} 

Let j'b be the big job such that <t(j'b) = i 

Add a big blocker I?b as a child to I? with J(Bb) = {jb} and M(Bb) = 2 
end if 
end while 
return a 



The set S(%) contains the set of small jobs with no potential moves with respect to %■ Therefore 
no job in S(Tt) has been reassigned since Tt became a subtree of T, i.e., since B t was added. We can 
thus omit the dependence on a when referring to S(Tt) and Si(Tt) without ambiguity. A related 
observation that will be useful throughout the analysis is the following. No job in a blocker B of 
T has been reassigned after B was added since that would have caused the algorithm to remove B 
(and all blockers added after B). 

We now continue by first proving that there always is a potential move to choose if [C-LP] is 



feasible followed by the proof that the procedure terminates in Section 3.1.2 



3.1.1 Existence of potential moves 

We prove that the algorithm never gets stuck if the [C-LP] is feasible. 

Lemma 3.7 If [C-LP] is feasible then Algorithm^ can always choose a potential move. 

Proof. Suppose that the algorithm has reached an iteration where no potential move is available. 
We will show that this implies that the dual of [C-LP] is unbounded and we can thus deduce as 
required that the primal is infeasible in the case of no potential moves. 

As each solution (y, z) of the dual can be scaled by a scalar a to obtain a new solution (ay, az), 
any solution such that X^ieX < ^jeJ z i i m P nes unboundedness. We proceed by defining such 
a solution (y* , z*): 

(2/3 if j eJ(T) is big, 

Pj = e if j € J(J~) U S is small, 
otherwise, 



Z 3 



and 

fl itieMs(T), 
\E je *-m) z j otherwise. 
Let us first verify that (y* , z*) is indeed a feasible solution. 

Claim 3.8 Assuming no potential moves are available, (y*,z*) is a feasible solution. 
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Proof of Claim. We need to verify that y* > Yljec z j ^ or eacn * £ M and each C G C(i). Recall 
that the total processing time of the jobs in a configuration is at most 1. Also observe that z* = 
for jobs not in J(T) U S and we can thus omit such jobs when verifying the constraints. 

Since z* < pj for all j G J , we have that no constraint involving the variable y* for i G Ms(T) 
is violated. Indeed for such a machine i we have ?/* = 1 and ^jeC(«) z i — ^2jeC(i) Pj — 1 f° r an y 
CeC(i). 

As a small job j G J(T) with a move (j, i) is a potential move if i G" Ms(T) and no such moves 
exist by assumption, no small jobs in J(T) can be moved to machines in M. \ M.s(T). Also, by 
definition, no small jobs in S can be moved to a machine i Ms- This together with the fact that 
a big job js has processing time 1 and is thus alone in a configuration gives us that a constraint 
involving y* for i G" Ms(T) can only be violated if y* < z* B =2/3. 

As a machine % G Mb(T) has a big job assigned, we have that for those y* > 2/3. Now consider 
the final case when i A4(T). If a big job in J{T) has a move to i then since it is not a potential 
move p(S'i) > R > 2/3. As z* = pj = e for small jobs, we have then y* = Yljeo- 1 ^) z j — Pi^i) > 
2/3, as required. 

We can thus conclude that no constraint is violated and (y* ,z*) is a feasible solution. □ 



Having proved that (y*, z*) is a feasible solution, the proof of Lemma 3.7 is now completed by 
showing that the value of the solution is negative. 

Claim 3.9 We have that ^2 ieM y* < Y^jej z j- 

Proof of Claim. By the definition of y* , 

2>r= E i+ E E A d) 

We proceed by bounding ^iex s (T) 1 f rom above by 

E E * 

i&M s {T) iea-^ii) 

Let Bq, B\, . . . , Bi be the blockers of T and consider a small blocker St for some t = 1, . . . ,£. 
By the definition of Algorithm [TJ was added in an iteration when either a potential small or 
big-to-small move (jo, it) was chosen with Ai(Bt) = it- Suppose first that (jo, it) was a potential 
small move. Then as it was not valid, p(jo) +p(a~ 1 (it)) > 1 + R. This inequality together with the 
fact that it is assigned at most one big job jg gives us that if (jo, it) is a small move then 

£ z*=p(a- x {i t ))-{p(j B )-z* B )>l + R-p(jo)-\ = \. (2) 

On the other hand, if (jo, it) is a potential big-to-small move then as it was not valid 

\ < R < p(a-\i t )) = Y, z h (3) 

where the equality follows from that it is only assigned small jobs (since we assumed (jo, it) was a 
big-to-small move). 

From Q and Q we can see that E^j e x s (7-) 1 is bounded from above by ^2 ieMs ^ Y^jea- 1 ^) z j 
if the number of small blockers added because of small moves is greater than the number of small 
blockers added because of big-to-small moves. 
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We proceed by proving this by showing that if Bt is a small blocker added because of a potential 
big-to-small move then Bt+i must be a small blocker added because of a small move. Indeed, the 
definition of a potential big-to-small move (jo, it) and the fact that it was not valid imply that 

p(S it (Tt))<R and p(a-\i t )) > R. 

As there are no big jobs assigned to it (using that (jo, it) was a big-to-small move), the above 
inequalities give us that there is always a potential small move of a small job assigned to it with 
respect to It- In other words, we have that Bt was not the last blocker added to T and as small 
potential moves have the smallest lexicographic value (apart from valid moves), Bt+\ must be a small 
blocker added because of a small move. We can thus "amortize" the load of Bt+\ to increase the load 
of B t . Indeed, if we let M(B t+1 ) = i t +i then @ and (§ yield £ i6CT - 1( ; t) _ + £y 6CT -i (it+l) z* > 2. 

Pairing each small blocker Bt added because of a big-to-small moves with the small blocker Bt+\ 
added because of a small move as above allows us to deduce that \Ais(T)\ < Y^ieMslT) Sj'eo- z j- 
Combining this inequality with ([I]) yields 



iGM i&Mj£a- j&J j<=J 



as required. □ 



We have proved that there is a solution (y*,z*) to the dual that is feasible (Claim 3.8) and 
has negative value (Claim 3.9) assuming there are no potential moves. In other words, the [C-LP] 
cannot be feasible if no potential moves can be chosen which completes the proof of the lemma. 

□ 

3.1.2 Termination 

We continue by proving that Algorithm [T] terminates. As the algorithm only terminates when a 
new job is assigned, Theorem 3.1 follows from Lemma 3. 10| together with Lemma |3 . 7| since then we 



can, as already explained, repeat the procedure until all jobs are assigned. 

The intuition that the procedure terminates, assuming there always is a potential move, is the 
following. As every time the algorithm chooses a potential move that is not valid a new blocker is 
added to the tree and as each machine can be in at most 2\M\ blockers, we have that the algorithm 
must choose a valid move after at most 2|A4| steps. Such a move will perhaps trigger more valid 
moves and each valid move makes a potential move previously blocked more "likely" . We can now 
guarantee progress by measuring the "likeliness" in terms of the lexicographic value of the move. 

Lemma 3.10 Assuming there is always a potential move to choose, Algorithm^ terminates. 

Proof. To prove that the procedure terminates we associate a vector, for each iteration, with the 
dynamic tree T ■ We will then show that the lexicographic order of these vectors decreases. 

The vector associated to T is defined as follows. Let Bo, B\, . . . , B$ be the blockers of T. With 
blocker Bi we will associate the value vector, denoted by Val(-Bj), of the move that was chosen in 
the iteration when Bi was added. The vector associated with T is then simply 

(Val(Bo),Val(Bi) J ...,Val(S < ),oo). 

If the algorithm adds a new blocker then the lexicographic order clearly decreases as the vector 
ends with oo. It remains to verify what happens when blockers are removed from T. In that case 
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let the algorithm run until it chooses a potential move that is not valid or terminates. As blockers 
will be removed in each iteration until it either terminates or chooses a potential move that is not 
valid we will eventually reach one of these cases. If the algorithm terminates we are obviously done. 

Instead, suppose that starting with a and T the algorithm does a sequence of steps where 
blockers are removed until we are left with an updated schedule o~ k , a tree of blockers T k with 
k + 1 < £ blockers, and a potential move (j',i ! ) that is not valid is chosen. As a blocker B is 
removed if a blocker added earlier is removed, we have that T k equals the subtree of T induced by 
Bo, B±, . . . , B k . 

We will thus concentrate on comparing the lexicographic value of with that of B k+ \. 

Recall that Val(i?fc + i) equals the value of the move that was chosen when B k+ i was added, say 
(jt, ik+i) for i t G J(B t ) with 1 < t < k and M(B k+l ) = i k+l . 

A key observation is that since blocker B k+ \ was removed but not B k , the most recent move was 
of a job j k+1 € J(B k+ i) and we have a(j k+1 ) = i k+1 and a k {j k+ i) / i k +i- Moreover, as (j t ,ik+i) 
was a potential move when B k+ \ was added, it is a potential move with respect to T k (using that 
Si k+1 (7fc) has not changed). Using these observations we now show that the lexicographic value 
of (jt,i k +i) has decreased. As the algorithm always chooses the move of minimum lexicographic 
value, this will imply < Val(B k+ i) as required. 

If (jt,ik+i) was a small or big-to-small move then B k+ \ was a small blocker. As no jobs 
were moved to i k+ \ after B k+ i was added, p(c^T 1 (i k +i)) < p(c _1 (ifc+i)) and we have that the 
lexicographic value of (j t ,i k +i) has decreased. 

Otherwise if (jt,ik+i) was a big-to-big move then j k+ i must be a big job. As we only have 
jobs of two sizes, the move of the big job j k+ \ implies that (jt,i k +i) now is a valid move which 
contradicts the assumption that the algorithm has chosen a potential but not valid move (j' , i'). 

We have thus proved that the vector associated to T always decreases. As there are at most 
2|A4| blockers in T (one small and one big blocker for each machine) and a vector associated to a 
blocker can take a finite set of values, we conclude that the algorithm terminates. □ 



4 Proof of Main Result 

In this section we extend the techniques presented in Section [3] to prove our main result, i.e., 
that there is a polynomial time algorithm that estimates the optimal makespan of the restricted 
assignment problem within a factor of || + e ~ 1.9412 + e, where e > is an arbitrarily small 
constant. More specifically, we shall show the following theorem which clearly implies Theorem |1.1| 
The small loss of e is as mentioned because the known polynomial time algorithms only solve [C-LP] 
up to any desired accuracy. 

Theorem 4.1 The [C-LP] has integrality gap at most 



Throughout this section we let R = If. The proof follows closely the proof of Theorem 



3.1 



i.e., 



we design a local search algorithm that returns a solution with makespan at most 1 + R, assuming 
the [C-LP] is feasible. The strategy is again to repeatedly call a procedure, now summarized in 
Algorithm [2j that extends a given partial schedule by assigning a new job while maintaining a valid 
schedule. Recall that a partial schedule is valid if each machine i G M. is assigned at most one big 
job and p(a~ 1 (i)) < 1 + R, where now R = We note that a machine is only assigned at most 
one big job will be a restriction here. 

Since we allow jobs of any sizes we need to define what big and small jobs are. In addition, we 
have medium jobs and partition the big jobs into large and huge jobs. 
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Definition 4.2 A job j is called big if Pj > 11/17, medium if 11/17 > pj > 9/17, and small if 
Pj < 9/17. Let the sets Jb^Jm^Js contain the big, medium, and small jobs, respectively. We will 
also call a big job j huge if pj > 14/17 and otherwise large. 

The job sizes are chosen so as to optimize the achieved approximation ratio with respect to the 
analysis and were obtained by solving a linear program. 

We shall also need to extend and change some of the concepts used in Section [3j The final goal 
is still that the procedure shall choose potential moves so as to guarantee (i) that the procedure 
terminates and that (ii) if no potential move exists then we shall be able to prove that the dual of 
[C-LP] is unbounded. 

The main difficulty compared to the case in Section [3] of two job sizes is the following. A key 
step of the analysis in the case with only small and big jobs was that if small jobs blocked the 
move of a big job then we guaranteed that one of the small jobs on the blocking machine had a 
potential small move. This was to allow us to amortize the load when analyzing the dual. In the 
general case, this might not be possible when a medium job is blocking the move of a huge job. 
Therefore, we need to introduce a new conceptual type of blockers called medium blockers that will 
play a similar role as big blockers but instead of containing a single big job they contain at least 
one medium job that blocks the move of a huge job. Rather unintuitively, we allow for technical 
reasons large but not medium or huge jobs to be moved to machines in medium blockers. 

Let us now point out the modifications needed starting with the tree T of blockers. 

Tree of blockers. Similar to Section [3j Algorithm [2] remembers its history by using the dynamic 
tree T of blockers. As already mentioned, it will now also have medium blockers that play a similar 
role as big blockers but instead of containing a single big job they contain a set of medium jobs. 
We use M.m{T) to refer to the subset of M(T) containing the machines in medium blockers. 

As in the case of two job sizes, Algorithm [2] initializes tree T with the special small blocker as 
root that consists of the job j new . The next step of the procedure is to repeatedly choose valid 
and potential moves until we can eventually assign j new . During its execution, the procedure now 
updates T based on which move that is chosen so that 

1. Ms(T) contains those machines to which the algorithm will not try to move any jobs; 

2. M.b{T) contains those machines to which the algorithm will not try to move any huge, large, 
or medium jobs; 

3. Mm(T) contains those machines to which the algorithm will not try to move any huge or 
medium jobs; 

4. J{T) contains those jobs that the algorithm wishes to move. 

We can see that small jobs are treated as in Section [3j i.e., they can be moved to all machines 
apart from those in A4s(T). Similar to big jobs in that section, huge and medium jobs can only 
be moved to a machine not in _M(7~). The difference lies in how large jobs are treated: they are 
allowed to be moved to machines not in A4 (T) but also to those machines only in A4 u (T) ■ 

Potential moves. As before a potential move (j, i) will be called valid if the update a(j) <— i 
results in a valid schedule, but the definition of potential moves needs to be extended to include 
the different job sizes. The definition for small jobs remains unchanged: a move (j, i) of a small 
job j £ J{T) is a potential small move if i g" A4s(T). A subset of the potential moves of medium 
and large jobs will also be called potential small moves. 
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Definition 4.3 A move of a medium or large job j G <J(T) satisfying i M(T) if j is 

medium and i G" Mb(T) U Ms(T) if j is large is a potential 

small move: if i is not assigned a big job. 

medium/large-to-big: if i is assigned a big job. 

Note that the definition takes into account the convention that large jobs are allowed to be assigned 
to machines not in Mb(T) U Ms(T) whereas medium jobs are only allowed to be assigned to 
machines not in Ai(T). The reason for distinguishing between whether % is assigned a big (huge 
or large) job will become apparent in the analysis. The idea is that if a machine i is not assigned 
a big job then if it blocks a move of a medium or large job then the dual variables z* will satisfy 
Yljeer- 1 (i) z j — 1' which we cannot guarantee if i is assigned a big job since when setting the dual 
variables we will round down the sizes of big jobs which might decrease the sum by as much as 
6/17. 

It remains to define the potential moves of huge jobs. 

Definition 4.4 A move of a huge job j G <J{T) to a machine i G" Ai(T) is a potential 

huge-to-small move: if no big job is assigned to i and p{j) + p{Si U Mj) < 1 + R; 

huge-to-big move: if a big job is assigned to i and p(j) + p(Si U Mi) < 1 + R; 

huge-to-medium move: if p{j) + p{Si) <1 + R and p(j) + p(Si U Mj) > 1 + R; 

where Si = {j G o-~ l (i) : j is small with T a (j) C Ms(T)} and Mi = Jm H a~ l {i). 

Again Si denotes the set of small jobs assigned to i with no potential moves with respect to the 
current tree. The set Mj contains the medium jobs currently assigned to i. Moves huge-to-small 
and huge-to-big correspond to the moves big-to-small and big-to-big in Section [3j respectively. The 
constraint p(j) +p{Si U Mj) < 1 + R says that such a move should only be chosen if it can become 
valid by not moving any medium jobs assigned to i. The additional move called huge-to-medium 
covers the case when moving the medium jobs assigned to i is necessary for the move to become 
valid. 

Similar to before, the behavior of the algorithm depends on the type of the chosen potential 
move. The treatment compared to that in Section [3] of valid moves and potential small moves is 
unchanged; the huge-to-small move is treated as the big-to-small move; and the medium/large-to- 
big and huge-to-big moves are both treated as the big-to-big move were in Section [3} It remains 
to specify what Algorithm [2] does in the case when a potential huge-to-medium move (that is not 
valid) is chosen, say of a job j G J{S) for some blocker B. In that case the algorithm adds 
a medium blocker Bm as child to B that consists of the machine i and the medium jobs assigned 
to i. This prevents other huge or medium jobs to be assigned to i. Also note that constraints 
p(j) + P(Si) < 1 + -R and p(j) + p(Si U Mj) > 1 + R imply that there is at least one medium job 
assigned to i. 

We remark that the rules on how to update T is again so that a job can be in at most one 
blocker whereas a machine can now be in at most three blockers (this can happen if it is first added 
in a medium blocker, then in a big blocker, and finally in a small blocker). 
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Values of moves. As in Section[3] it is important in which order the moves are chosen, Therefore, 
we assign a value vector to each potential move and Algorithm [2] chooses then, in each iteration, 
the move with smallest lexicographic value. 

Definition 4.5 If we let Li = o~~ l (i) then a potential move has value 



Val&i) 



r (0,0) 

(p(j),p(U)) 
(2,0) 

(3,p(£i)) 
(4,0) 



if valid, 

if small move, 

if medium/large-to-big, 

if huge-to-small, 

if huge-to-big, 



(5, \L{ n Jm\) if huge-to-medium. 



Note that as before, the algorithm chooses a valid move if available and a potential small move 
before any other potential move. Moreover, it chooses the potential small move of the smallest job 
available. 



The algorithm. Algorithm [2] summarizes the algorithm concisely using the concepts described 
previously. Given a valid partial schedule a and an unscheduled job j neW i we shall prove that it 
preserves a valid schedule by moving jobs until it can assign j new . Repeating the procedure until 



all jobs are assigned then yields Theorem 4.1 



Algorithm 2 ExtendSchedule(<7,j new ): 
1: Initialize T with the root J{B) = {jnew} and M(B) = _L 
2: while a{j new ) is TBD do 

3: Choose a potential move (J, i) with j G <J(T) of minimum lexicographic value 

4: Let B be the blocker in T such that j £ <J(B) 

5: if (j, i) is valid then 

6: Update the schedule by a(j) •<— i 

7: Update T by removing B and all blockers added after B 

8: else if (j, i) is either a potential small or huge-to-small move then 

9: Add a small blocker B s as child to B with M(B S ) = i and J{B S ) = a' 1 ^) \ J{T)\ 

10: else if (j, i) is either a potential large/medium-to-big or huge-to-big move then 

11: Let jb be the big job such that cf{3b) = i 

12: Add a big blocker B B as a child to B in T with J(B B ) = {]b} and M{B B ) = i 
13: else {(j,i) is a potential huge-to-medium move} 

14: Add a medium blocker Bm as child to B with M.{Bm) = i and J{Bm) = o~~ l {i) H Jm 
15: end if 
16: end while 
17: return a 



4.1 Analysis 

As Algorithm [T] for the case of two job sizes, Algorithm [2] only updates the schedule if a valid move 
is chosen so it follows that the schedule stays valid throughout the execution. It remains to verify 
that the algorithm terminates and that the there always is a potential move to choose. 

The analysis is similar to that of the simpler case but involves more case distinctions. When 
arguing about T we again let 
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• % be the subtree of T induced by the blockers Bq, B%, . . . , Bt] 

• S(Tt) = {j G J ■ j is small with T a (j) C A4s(Tt)} and often refer to S(T) by simply S; and 

• Si(Tt) = a- x {i) n S(Tt) (often refer to Si(T) by Si). 

As in the case of two job sizes, the set S(7l) contains the set of small jobs with no potential moves 
with respect to %■ Therefore no job in S{Tt) has been reassigned since 7t became a subtree of T, 
i.e., since B t was added. We can thus again omit without ambiguity the dependence on a when 
referring to S(7l) and Si(7t). The following observation will again be used throughout the analysis. 
No job in a blocker B of T has been reassigned after B was added since that would have caused 
the algorithm to remove B (and all blockers added after B). 

In Section [4.1.1 we start by presenting the proof that the algorithm always can choose a potential 



move if [C-LP] is feasible. We then present the proof that the algorithm always terminates in 
Section |4.1.2| As already noted, by repeatedly calling Algorithm [2] until all jobs are assigned we 
will, assuming [C-LP] feasible, obtain a schedule of the jobs with makespan at most 1 + R which 



completes the proof of Theorem 4.1 



4.1.1 Existence of potential moves 

We prove that the algorithm never gets stuck if the [C-LP] is feasible. 

Lemma 4.6 // [C-LP] is feasible then Algorithm^ can always pick a potential move. 

Proof. Suppose that the algorithm has reached an iteration where no potential move is available. 



Z 3 



Similar to the proof of Lemma 3.7 we will show that this implies that the dual is unbounded and 
hence the primal is not feasible. We will do so by defining a solution (y*,z*) to the dual with 

T,ieM Vi < T,jeJ z j- Define z * and V* by 

' 11/17, if j € J(T) is big, 

9/17, if j G J(T) is medium, 

Pj, if j G J(J~) U S is small, 

0, otherwise, 

and 

y * fi if i€A* s (n 

\Ejea-Hi) z j otherwise. 

A natural interpretation of z* is that we have rounded down processing times where big jobs with 
processing times in [11/17, 1] are rounded down to 11/17; medium jobs with processing times in 
[9/17, 11/17] are rounded down to 9/17; and small jobs are left unchanged. 

The proof of the lemma is now completed by showing that (y* ,z*) is a feasible solution 



(Claim 4.7) and that the objective value is negative (Claim |478|) . 



Claim 4.7 Assuming no potential moves are available, (y*,z*) is a feasible solution. 

Proof of Claim. We need to verify that y* > X^jeC z j ^ or each j £ and each C G C{i). Recall 
that the total processing time of the jobs in a configuration is at most 1. Also observe as in the 
simpler case that z* = for jobs not in U S and we can thus omit them when verifying the 

constraints. 
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As in the previous case, no constraint involving the variable y* for i G Ms(T) is violated. 
Indeed, for such a machine i we have y* = 1 and ^jeC(i) z j — YljeC(i)Pj — 1 f° r an y configuration 
CeC(i). ' " ' " 

We continue by distinguishing between the remaining cases when i G" M (T) ,i £ Mb (X) \ 
M S (T), and i G M M (T) \ (A^(T) u M B (T)). 

« .A/f (T): A move (j, i) to machine i .A/f (T) is a potential move if j G <J(T) is a small, medium, 
or large job. By assumption we have thus that no such jobs in J{T) have any moves to i. We also 
have, by definition, that no job in S has a move to i G" Ms(T). 

Now consider the final case when a huge job jg G J{T) has a move (jH,i)- Then since it is 
not a potential huge-to-small, huge-to-medium, or huge-to-big move 

pUh) +p(Si) > 1 + R and hence p(Si) > R. 

Since p(jn) > 14/17 but = 11/17 a configuration C G C(i) with j# G C satisfy 

E4 < z *3h + (i-pOh)) < § + (i- ^) = y v W 

which is less than ^ = R < p(Si) < Yljea- 1 U) z j — y*i wnere we used that S% only contains small 
jobs for the second inequality. 

i G Mb(T) \ Ms(T): A move to machine i Ms(T) is a potential move if j G i7(T) is a small 
job. By the assumption that there are no potential moves and by the definition of S, we have thus 
that there is no small job in J{T) U S with a move to i. In other words, all small jobs with z* > 
that can be moved to i are already assigned to i. Now let js be the big job such that o{Jb) = i 
which must exist since machine i is contained in a big blocker. As both medium and big (large and 
huge) jobs have processing time strictly greater than 1/2, a configuration C £C(t) can contain at 
most one such job jo- Since z* Q < 11/17 and z*j g = 11/17 we have that a configuration C G C(i) 
with jo G C cannot violate feasibility. Indeed, 

£*i<4+ E ^4,+ E z >=y*- ( 5 ) 

jec jea- 1 (i)\j B jea-^ii^js 

i G Mm(T) \ (Ms(T) U A / i_e(T)) : By the definition of S and the assumption that there are no 
potential moves, no small or large jobs in J(T) U S have any moves to i. As i is contained in a 
medium blocker, there is a medium job jm such that (t(Jm) = As both medium and huge jobs 
have processing time strictly greater than 1/2, a configuration C G C(i) can contain at most one 
such job jo- 

If jo is medium then z* o < z* M and we have by Inequality ^ (substituting js with jm) that 
the constraint is not violated. Otherwise if jo is huge then from Q (substituting jh with jo) we 
have that Sjec* z j — if ^ or an y configuration C G C[i) with jo G C. 

We shall now show that y* > jk which completes this case. If i is assigned more than one 
medium job then y* > 1. Instead suppose that the only medium job assigned to i is ju- Since the 
medium blocker Bm, with M(Bm) = i and J{Bm) = {]m}, was added to T because the algorithm 
chose a potential huge-to-medium move we have 

p(5 i (r / )) + p(j A f) >r. 
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where T' refer to the tree of blockers at the time when Bm was added. 

As each blocker in T' is also in T , p(Si) > p(Si(T')) and the above inequality yields, 

y* > p(s 1 )+pUm) - (pUm) - z* M ) > R- - > ^, 

as required. 

We have thus verified all the cases which completes the proof of the claim. □ 

Having verified that (y* ,z*) is a feasible solution, the proof is now completed by showing that 
the value of the solution is negative. 

Claim 4.8 We have that Y^ieM Vi < YljeJ z j- 
Proof of Claim. By the definition of y* , 

£%* = £ 1+ £ £ 4 w 

ieM ieM s (T) i^MsCTtjea- 1 ^) 

Similar to the case with two job sizes in Section[3l we proceed by bounding ^ieA^s(T) ^ f rom above 
^ ^2ieM s (T) Yljecr- z j- Let Bq, B\, . . . , Bg oe the blockers of T and consider a small blocker 
B t for some t = l,...,£ with M(B t ) = i t . 

By the definition of the procedure, small blocker Bt has been added in an iteration when either 
a potential small or huge-to-small move (jo, it) was chosen. Let us distinguish between the three 
cases when (jo, it) was a small move of a small job, (jo, it) was a small move of a medium or large 
job, and (jo, it) was a huge-to-small move. 



(jO) U) was a small move of a small job: Since the move was not valid, 

P(jo) +p(<T~ 1 (i t )) >l + R. 

As a big job with processing time in [11/17, 1] is rounded down to 11/17 and a medium job with 
processing time in [9/17,11/17] is rounded down to 9/17, the sum X^eo-- 1 ^) z j wm depend of 
the number of big and medium jobs assigned to it- Indeed, if we let Lj ( = o" _1 (^) n Jb and 
Mi t = o-~ 1 (i t )C\JM be the big and medium jobs assigned to it, respectively. Then on the one hand, 



E. 4>i+*-*o)-(i-£)| i( ,i-(§-|) 



jeo- 1 (i t ) 



which equals 



33 
17 



6 

17 



p(jo)- 



On the other hand, if (i) |Lj t | > 2, (ii) |Lj t | > 1 and |Mj| > 1 or (iii) |Mj t | > 3 then clearly 

20 



E 



.11 9 . 

z* > 1 > 

17 17 ~ 17 



3 



jea 1 (i t ) 

Combining these two bounds we get (since jo is small and thus p(jo) < 9/17) that 



z* > min 

j&cr~ 1 (k) 



27 
17 



P(jc 



20 
17 
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> — if (jo, it) was a small move of a small job. (7) 
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(jo it) was a small move of a medium or large job: Since the move was not valid, we have 
again 

p(jo) +P(<r~ 1 (it)) > 1 + 1?. 

By the definition of potential small moves it must be that there is no big job assigned to it. If there 
are more than one medium job assigned to i then clearly Yljea- z j — !• 

Otherwise, if there is at most one medium job assigned to it that might have been rounded 
down from 11/17 to 9/17 we use the inequality p(jo) +p{(J~ 1 {i t )) > 1 + R to derive 

E 3>i + *-pCjo)-(§-^)=§-p(*>)>i, 

For the final inequality we used that the processing time of medium and large jobs is at most 14/17. 
Summarizing again, we have 

Zj > 1 if (jo, it) was a potential small move of a medium or large job. (8) 

(jo, it) was a huge-to-small move: Similar to the case of two job sizes in Section [3] where we 
considered the big-to-small move, we will show that we can amortize the cost from the blocker Bt+i. 
Indeed, since the move (jo, it) was not valid and by the definition of a potential huge-to-small move 
we have 

p(Ja)+P(<r~\it)) > 1 + R and p(j ) +p (S k (%) U M k ) < 1 + R, (9) 

where we let Mj t contain the medium jobs assigned to it at the time when Bt was added. Recall that 
Si t (7~t) contains those small jobs that have no potential moves with respect to the tree of blockers 
71 after Bt was added. As Bt has not been removed from T both these sets have not changed. In 
particular, since it is not assigned a big job (using the definition of huge-to-small move) we have 
that there must be a small job j' £ a" 1 (it) \ (Si t (Tt) U Mj t ) that has a potential small (j' , i') move 
with respect to Tt- 

The above discussion implies that t < £. Moreover, the value of (j',i') equals (p(j') , p(a~ l (i 1 ))) ■ 
As the procedure always chooses a potential move with minimum lexicographic value we have that 
B t+ \ is a small blocker added because a potential small move was chosen of a small job with 
processing time at most p(j')- 

If we let M(Bt+\) = it+i we have thus by ([7]) 



z* > min 

je<r~ 1 (it+i) 



27 ( ., 20 
17" P(J) '17 



* Tr 



We proceed by showing that we can use this fact to again amortize the cost as done in the simpler 
analysis, i.e., that 

E 4+ E 4> 2 - (ii) 

jeer jeo- 1 (i t +i) 

For this reason, let us distinguish between three subcases depending on the number of medium jobs 
assigned to it- 

it is assigned at least two medium jobs: Since medium jobs have value 9/17 in the dual this clearly 
implies that Yljecr- 1 (it) z j — 18/17 so no amortizing is needed and ( [TT] ) holds in this case. 



18 



it is assigned one medium job: Let jm denote the medium job assigned to it- In addition we have 
that the small job j' is assigned to it. We have thus that Yljea- z j — 9/1? + P(iO- So if 
p(j') > 7/17 then we can amortize because Yljea^Ok i) z j ^ s a * l eas t 18/17. 

Now consider the case when p(j') < 7/17. By the assumption that only one medium job is 
assigned to it and since p(jo) + v{ a ~ l {h)) > 1 + -R we have 

\ z i- R -{vi-Ti) = Tr 



Moreover, as the case when p(j') < 7/17 is considered, we have from (10) that 



V z*> 2 -° 
4- Z 3~ 17' 

jeer 1 (lt+l) 



which implies that (11) is valid also in this case. 



it is not assigned any medium jobs: As it in this case is not assigned any medium or big jobs and 
the huge-to-small move (jo, it) was not valid, 



That we can amortize now follows from (10) that we always have Sjeo— z j — 



18 
17' 



We have thus shown that a blocker Bt added because of a huge-to-small move can be paired 
with the following blocker Bt+i that was added because of a potential small move of a small job. 
As seen above this give us that we can amortize the cost exactly as done in the simpler case and 
we deduce that \A4s{T)\ < 5ZieMs(T) X^jeo— z j- Combining this inequality with ([I]) yields 



as required. □ 



We have proved that there is a solution (y*,z*) to the dual that is feasible (Claim 4.7) and 



has negative value (Claim 4.8) assuming there are no potential moves. In other words, the [C-LP] 



cannot be feasible if no potential moves can be chosen which completes the proof of the lemma. □ 
4.1.2 Termination 

As the algorithm only terminates when a new job is assigned, Theorem |4. 1| follows from the lemma 



below together with Lemma 4.6 since then we can, as already explained, repeat the procedure until 
all jobs are assigned. 

Lemma 4.9 Assuming there is always a potential move to choose, Algorithm^ terminates. 



Proof. As in the proof of Lemma 3.10 , we consider the i'th iteration of the algorithm and associate 
the vector 

CVal(Si), Val(S 2 ), . . . , Val(5<), oo). 
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with T where the blockers are ordered in the order they were added and Val(-Bj) equals the value 
of the move chosen when Bi was added. We shall now prove that the value of the vector associated 
with T decreases no matter the step chosen in the next iteration. 

If a new blocker is added in the % + l'th iteration the lexicographic order clearly decreases as 
the vector ends with oo. It remains to verify what happens when blockers are removed from T. In 
that case let the algorithm run until it chooses a potential move that is not valid or terminates. 
As blockers will be removed in each iteration until it either terminates or chooses a potential move 
that is not valid we will eventually reach one of these cases. If the algorithm terminates we are 
obviously done. 

Instead, suppose that starting with a and T the algorithm does a sequence of steps where 
blockers are removed until we are left with an updated schedule a k , a T k with k + 1 < £ blockers, 
and a potential move that is not valid is chosen. As a blocker B is removed if a blocker 

added earlier is removed, we have that T k equals the subtree of T induced by Bq, B\, . . . , B k . 

We will thus concentrate on comparing the lexicographic value of with that of B k+ \. 

The value of B k+ \ equals of the value of the move chosen when Bk+i was added, say (jt, ijfe+i) for 
jt G J{B t ) with 1 < t < k and M(B k+1 ) = i k+1 . 

A key observation is that since blocker B k+ \ was removed but not B k , the most recent move was 
of a job j k+1 € J(B k+1 ) and we have a(j k+1 ) = i k+1 and a k (jk+i) Moreover, as (j t ,i k +i) 

was a potential move when B k+ i was added, it is a potential move with respect to T k (using that 
Si k+1 {Tk) has not changed). Using these observations we now show that the lexicographic value of 
(jt,ik+i) has decreased. As the algorithm always picks the move of minimum lexicographic value, 
this will imply V&l(j',i') < Vel(Bk+i) as required. 

If (jt,ik+i) was a small or huge-to-small move then B k+ \ was a small blocker. As no jobs were 
moved to i k+ \ after B k+ \ was added, p(<r k (i k +i)) < p(a^ 1 (i k+ i) and we have that the lexicographic 
value of (jt,ik+i) has decreased. 

If (jt,i k +i) was a medium/large-to-big (or huge-to-big move) then j k+ \ must be a big job and 
hence the machine i k+ \ has no longer a big job assigned. This implies that the move (jt,i k +i) is 
now a potential small (or huge-to-small using that no medium jobs are moved to machines in big 
blockers) with smaller lexicographic value. 

Finally, if (j t , ijt+i) was a huge-to-medium move then jk-vx must be a medium job and since no 
medium jobs have potential moves to machines in medium blockers we have that |<7^ 1 (f) n Jm\ < 
\a~ l (i) n Jm\ which implies that the lexicographic value of (jt,ik+i) also decreased in this case. 

We have thus proved that the vector associated to T always decreases. As there are at most 
3|.M| blockers in T (one small, medium, and big blocker for each machine) and a vector associated 
to a blocker can take a finite set of values, we conclude that the algorithm eventually terminates. 
□ 

5 Conclusions 

We have shown that the configuration LP gives a polynomial time computable lower bound on the 
optimal makespan that is strictly better than two. Our techniques are mainly inspired by recent 
developments on the related Santa Claus problem and gives a local search algorithm to also find a 
schedule of the same performance guarantee, but is not known to converge in polynomial time. 

Similar to the Santa Claus problem, this raises the open question whether there is an efficient 
rounding of the configuration LP that matches the bound on the integrality gap (see also [8] for 
a comprehensive discussion on open problems related to the difference between estimation and 
approximation algorithms). Another interesting direction is to improve the upper or lower bound 
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on the integrality gap for the restricted assignment problem: we show that it is no worse than 
33/17 and it is only known to be no better than 1.5 which follows from the NP-hardness result. 
One possibility would be to find a more elegant generalization of the techniques, presented in 
Section [3] for two job sizes, to arbitrary processing times (instead of the exhaustive case distinction 
presented in this paper). 

To obtain a tight analysis, it would be natural to start with the special case of graph balancing 
for which the 1.75- approximation algorithm by Ebenlendr et al. [6] remains the best known. We 
remark that the restriction pij G {pj, 00} is necessary as the integrality gap of the configuration LP 
for the general case is known to be 2 even if a job can be assigned to at most 2 machines [19J. 
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